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O ■ Abstract 

o 

ps| ■ We examine several online matching problems, with applications to Internet advertising reservation 

systems. Consider an edge-weighted bipartite graph G, with partite sets L, R. We develop an 8-competitive 
algorithm for the following secretary problem: Initially given R, and the size of L, the algorithm receives 
the vertices of L sequentially, in a random order. When a vertex / e L is seen, all edges incident to I are 
t ■ revealed, together with their weights. The algorithm must immediately either match I to an available vertex 

of R, or decide that I will remain unmatched. 

^y^ • In im, the authors show a 16-competitive algorithm for the transversal matroid secretary problem, which 

^y , [ is the special case with weights on vertices, not edges. (Equivalently, one may assume that for each I E L, 

the weights on all edges incident to / are identical.) We use a similar algorithm, but simplify and improve 

Q [ the analysis to obtain a better competitive ratio for the more general problem. Perhaps of more interest is 

the fact that our analysis is easily extended to obtain competitive algorithms for similar problems, such as to 

find disjoint sets of edges in hypergraphs where edges arrive online. We also introduce secretary problems 

^ ■ with adversarially chosen groups. 

Q^ I Finally, we give a 2e-competitive algorithm for the secretary problem on graphic matroids, where, with 

CO ■ edges appearing online, the goal is to find a maximum-weight acyclic subgraph of a given graph. 

o 

00 

O 

> 

X 



*Dept. of Computer Science, University of Illinois, Urbana, IL 61801. This work was done while the author was at Google Inc., 

NY. nkorula2@uiuc . edu 

^Google Inc., 76 9th Avenue, New York, NY 10011. mpalSgoogle . com 



1 Introduction 

Many optimization problems of interest can be phrased as picking a maximum-weight independent subset from 
a ground set of elements, for a suitable definition of independence. A well-known example is the (Maximum- 
weight) Independent Set problem on graphs, where we wish to find a set of vertices, no two of which are 
adjacent. A more tractable problem in this setting is the Maximum- weight Matching problem, in which we 
wish to find a set of edges such that no two edges share an endpoint. This notion of independence can be 
naturally extended to hypergraphs, where a set of hyperedges is considered independent if no two hyperedges 
share a vertex. 

In the previous examples, independent sets are characterized by forbidding certain pairs of elements from 
the ground set. A somewhat related, but different notion of independence comes from the independent sets of 
a matroid. For example, in the uniform matroid of rank k, any set of at most k elements is independent. For 
graphic matroids, a set of edges in an undirected graph is independent if and only if it does not contain a cycle; 
the optimization goal is to find a maximum-weight acyclic subgraph of a graph G. In transversal matroids, a 
set of left- vertices of a bipartite graph is independent if and only if there is a matching that matches each vertex 
in this set to some right-vertex. 

In many applications, the elements of the ground set and their weights are not known in advance, but arrive 
online one at a time. When an item arrives, we must immediately decide to either irrevocably accept it into the 
final solution, or reject it and never be able to go back to it again. We will be interested in competitive analysis, 
that is, comparing the performance of an online algorithm to an optimal offline algorithm which is given the 
whole input in advance. In this setting, even simple problems like selecting a maximum-weight element become 
difficult, because we do not know if elements that come in the future will have weight significantly higher or 
lower than the element currently under consideration. If we make no assumptions about the input, any algorithm 
can be fooled into performing arbitrarily poorly by ofering it a medium-weight item, followed by a high-weight 
item if it accepts, and a low-weight item if it rejects. To solve such problems, which frequently arise in practice, 
various assumptions are made. For instance, one might assume that weights are all drawn from a known 
distribution, or (if independent sets may contain several elements) that the weight of any single element is small 
compared to the weight of the best independent set. 

One useful assumption that can be made is that the elements of the ground set appear in a random order. 
The basic problem in which the goal is to select the maximum-weight element is well known as the Secretary 
Problem. It was first published by Martin Gardner in [6|, though it appears to have arisen as folklore a decade 
previously Q. An optimal solution is to observe the first n/e elements, and select the first element from the 
rest with weight greater than the heaviest element seen in the first set; this algorithm gives a 1/e probability of 
finding the heaviest element, and has been attributed to several authors (see [51). 

Motivated by this simple observation, several results have appeared for more complex problems in this 
random permutation model; these are often called secretary-type problems. Typically, given a random permu- 
tation of elements appearing in an online fashion, the goal is to find a maximum-weight independent set. For 
example, Kleinberg |7| gives a 1 + 0(l/\/A;)-competitive algorithm for the problem of selecting at most k 
elements from the set to maximize their sum. Babaioff et al. |2| give a constant-competitive algorithm for the 
more general Knapsack secretary problem, in which each element has a size and weight, and the goal is to find 
a maximum-weight set of elements whose total size is at most a given integer B. 

Babaioff et al. HI had earlier introduced the so-called matroid secretary problem, and gave an Oilogk)- 
competitive algorithm to find the max- weight independent set of elements, where k is the rank of the underlying 
matroid. A 16-competitive algorithm was also given in [1] for the special case of graphic matroids; this was 
based on their 4d-competitive algorithm algorithm for the important case of transversal matroids, where d is 
the maximum degree of any left- vertex. Recently, Dimitrov and Plaxton H improved the latter to a ratio of 16 
for all transversal matroids. A significant open question is whether there exists a 0(l)-competitive algorithm 
for general matroids, or for other secretary problems with non-matroid constraints. 



These secretary-type problems arise in many practical situations where decisions must be made in real-time 
without knowledge of the future, or with very limited knowledge. For example, a factory needs to decide which 
orders to fulfil, without knowing whether more valuable orders will be placed later. Buyers and sellers of houses 
must decide whether to go through with a transaction, though they may receive a better offer in a week or a 
month. Below, we give an example from online advertising systems, which we use as a recurring motivation 
through the paper. 

Internet-based systems are now being used to sell advertising space in other media, such as newspapers, 
radio and television broadcasts, etc. Advertisers in these media typically plan advertising campaigns and reserve 
slots well in advance to coincide with product launches, peak shopping seasons, or other events. In such 
situations, it is unreasonable to run an auction immediately before the event to determine which ads are shown, 
as is done for sponsored search and other online advertising. 

Consider an automatic advertising reservation system, in which the seller controls a number of slots, each 
representing a position in which an advertisement (hereafter ad) can be published. Advertisers/Bidders appear 
periodically, and report which slots they would like to place an ad in, and how much they are willing to pay for 
each slot. When an advertiser reports a bid, the system must immediately decide whether or not to accept it; if a 
bid is accepted, the ad must be placed in the corresponding slot, and if not, the ad is permanently rejected. Note 
that in disallowing the removal of an accepted ad, our model differs significantly from that of [3 1, in which the 
seller can subsequently remove an accepted ad if he makes a compensatory payment to the advertiser. 

We model this system as as an online edge-weighted matching problem on a bipartite graph G{L U R,E): 
the vertices of set R correspond to the set of slots, and those of set L to the ads. For each vertex / G L, its 
neighbors in R correspond to the slots in which ad / can appear, and the weight of edge {I, r) is the amount 
the advertiser is willing to pay if / appears in slot r. Initially, the seller knows the set of slots R; vertices of L 
appear sequentially in a random order, as advertisers bid on slots. When a vertex / G L is seen, all the edges 
from / to R are revealed, together with their weights; the seller must immediately decide whether to accept ad 
I, and if so, which of the relevant slots to place it in. The seller's goal, obviously, is to maximize his revenue. 
Subsequently, we refer to this problem as Bipartite Vertex-at-a-time Matching (BVM). We describe our results 
for BVM and other problems below. 

1.1 Results and Outline 

Recall that the elements of a transversal matroid are one partite set L (subsequently referred to as the left 
vertices) of a bipartite graph, and a set of vertices S* C L is independent if the graph constains a perfect 
matching from S to the other partite set. That is, the transversal matroid secretary problem is equivalent to the 
special case of BVM in which all edges incident to each / G L have the same weight. (Equivalently, the weights 
are on vertices of L instead of edges.) In Section |2l we give a simpler and tighter analysis for an algorithm 
essentially similar to that of Dimitrov and Plaxton ||4l for transversal matroids; this allows us to improve the 
competitive ratio from 16 to 8, even for the more general BVM problem. 

In addition to an improved ratio, our methods are of interest as they appear robust to changes in the model 
and can be naturally applied to more general problems. We illustrate this in Section |3] by extending our algo- 
rithms to hypergraph problems, with applications to more complex advertising systems in which advertisers 
desire bundles of slots, as opposed to a single slot. In particular, we obtain constant-competitive algorithms for 
finding independent edge sets in hypergraphs of constant edge-size. 

We also introduce secretary problems with groups, to model applications in which we do not see a truly 
random permutation of elements. We assume that an adversary can group the elements arbitrarily, but once 
the groups are constructed, they appear in random order. When a group appears, the algorithm can see all the 
elements in the group. We discuss this idea further in Section IH 

Finally, in Section |5l we obtain a simple 2e-competitive algorithm for the problem of finding independent 
edge-sets in graphic matroids, improving the ratio of 16 from m. 



The majority of our algorithms follow the "sample-and-price" method common to many solutions to sec- 
retary problems. That is, we look at a random sample of elements containing a constant fraction of the input, 
and use the values observed to determine prices or thresholds. In the second half, we accept an element if its 
weight/value is above the given price. For instance, in the optimal solution to the original secretary problem, 
the price is set to be the highest value seen in the first 1/e fraction of the input, and we accept any element from 
the remaining set with value greater than this price. 



2 The Bipartite Vertex-at-a-time Matching Problem 

Recall that in the BVM problem, the algorithm is initially given one partite set i? of a bipartite graph G{L U 
R, E), together with the size of the other partite set L. The algorithm sees the vertices of L sequentially, in a 
random order. When a vertex / G L is seen, all edges incident to / are revealed, together with their weights. The 
algorithm must immediately either match / to an available vertex of R, or decide that I will remain permanently 
unmatched. In this section, we show that an algorithm based on that of [4| gives a competitive ratio of 8 for this 
problem. Before presenting the algorithm for BVM, we describe a closely related algorithm Simulate that is 
easier to analyze, and then show that our final algorithm does at least as well as Simulate. 

Let Greedy denote the following greedy algorithm for the offline Edge-weighted bipartite matching problem: 



GREEDY(G(LUfi,£;)) : 
Sort edges of E in decreasing order of weight. 
Matching M ^ 

For each edge e G -E, in sorted order 
If M U e is a matching: 
M ^ MUe 
Return M. 



Let w{F) denote the weight of a set of edges F, and OPT denote the weight of an optimum (max- weight) 
matching on G. It is easy to see the following proposition, that Greedy is a 2-approximation. 

Proposition 2.1 w{M) > OPT/2. 

We now describe the algorithm Simulate, which we use purely to analyze our final algorithm for BVM. 



Simulate: 


Sort edges of G{L U R, E) in decreasing order of weight. 


Ml, Ma ^0 


Mark each vertex I e L as unassigned. 


For each edge e = (/, r) G E, in sorted order 


If / is unassigned AND Mi U e is a matching: 


Mark / as assigned 


Fhp a coin with probabihty p of heads 


If heads. Mi ^ Mi U e 


Else M2 ^ M2 U e 


M3 ^ M2 


For each vertex r e R 


If r has degree > 1 in M3 


Delete all edges incident to r from M3. 



Say that an edge e is considered by Simulate if we flip a coin and assign e to either Mi or M2. We make 
two observations about Simulate: Once any edge incident to a vertex / G L has been considered, no other 



edge incident to I will be considered later. Second, once an edge incident to r £ R has been added to Mi, no 
subsequent edge incident to r will be considered. (Note that multiple edges incident to r might be considered 
until one of these edges is added to Mi.) 

Observe that from our description of Simulate, Mi is a matching, but M2 may not be, as a vertex r £ R 
may be incident to multiple edges of M2. Hence, we have a final pruning step in case there are multiple edges 
incident to the same vertex of i?; this gives us a matching M3. We now prove three statements about Simulate, 
and later show that the matching returned by our online algorithm is at least as good as M3. 

Proposition 2.2 ¥.[w{Mi)\ > pOPT/2. 

Proof: Simulate tosses a coin (at most) once for each vertex in L; Mi is precisely the matching one would 
obtain from running Greedy on L' U R, where V denotes the vertices which came up heads. (If the coin for 
a vertex comes up tails, this vertex has no effect on Mi.) If OPT' denotes the weight of an optimum matching 
on L' U R, it is easy to see that E[OPT'] > pOPT, and hence that E[tt;(Mi)] > pOPT/2. D 

Lemma 2.3 E[u;(M2)] > (l-p)0PT/2. 

Proof: Consider any history of coin tosses in which an arbitrary edge e is being considered, and we are about 
to flip a coin to determine whether e is added to Mi or M2. Its expected contribution to Mi is pw{e), and to 
M2, is (1 — p)w{e). This holds for each edge e and any history in which e can contribute to the weight of Mi 
or M2; hence E[w{M2)] = ^^^^E[w{Mi)], completing the proof. D 

Lemma 2.4 E[w{M3)] > ^^^%^OPT. 

Proof: For each vertex v £ R, let Revenue2{v) denote the revenue earned by vertex v in M2, which we define 
as the sum of the weights of edges in M2 incident to v. (Hence, J2^ Revenue2{v) = w{M2).) For each edge 
e incident to v, let K[Revenue2{v)\e\ denote the expected revenue earned by v in M2, conditioned on the fact 
that e is the first edge incident to v selected by Simulate for M2. It is easy to see that E.[Revenue2{v)\e] < 
w{e)/p, by considering how v can earn revenue: If the next edge incident to v considered by Simulate is 
added to Mi (which happens with probability p), then v earns precisely w{e), as no later edge incident to v 
can ever be considered. In general, if v is incident to i edges in M2, the revenue it earns is at most iw{e), 
and the probability of this event is at most (1 — p)*~^ • p; this is because the next i — 1 edges incident to v 
that are considered must be added to M2, and the ith edge is added to Mi. Therefore, 'E,[Revenue2{v)\e] < 
w{e) YlZi i • P(l - pT'^ = w{e)/p. 

Similarly, for each vertex v £ R, let Revenues (v) denote the revenue earned by vertex v in M3, which is the 
weight of the (at most one) edge incident to v in M3. Let M[Revenue3 {v) \ e] denote the expected revenue earned 
by V, conditioned on e being the first edge incident to v added to M2. With probability p, the next considered 
edge incident to v is added to Mi, and hence v has degree 1 in M2. Therefore, E.[Revenues{v)\e] > pw{e), 

and so ¥.[Revenues{v)\e] > p'^¥.[Revenue2{v)\e]; it follows that E[u'(M3)] > p'^¥.[w{M2)] = ^^%^OPT. 

D 

Before describing our final algorithm for EBP, we show that the matching returned by an intermediate 
algorithm SampleAndPermute is at least as good as M3, which implies that we have a ,-.^^ ^ -competitive 
algorithm: setting p = 2/3, we get a 13.5-competitive algorithm. However, our pruning step allows us to take 
an edge for M3 only if its right endpoint has degree 1 ; a more careful pruning step allows more edges in the 
matching. We use this fact to give a tighter analysis for the next algorithm, obtaining a competitive ratio of 8. 

Note that the matching Mi in SampleAndPermute is precisely the same as Mi from Simulate; in- 
tuitively, in the former, we toss all the coins at once and run Greedy, while in the latter, we toss coins while 
constructing the Greedy Matching. (More precisely, the two algorithms to generate the matchings are equiv- 
alent.) Similarly, the "matching" M2 in this algorithm is essentially M2 from Simulate. The difference 
between the two algorithms is in the pruning step: To construct M3 in Simulate, we delete all edges incident 



SampleAndPermute(G(L U R, E)): 

For each I ^ L: 

With probability p, L' ^ L' U {1} 
Ml ^ Greedy(G[L' U R]). 
For each r ^ R: 

Set price(r) to be the weight of the edge incident to r in Mi. 
M, M2 ^ 
For each / G L — L', in random order: 

Let e = (/, r) be the highest-weight edge such that w{e) > price{r) 

AddetoM2. 

If M U e is a matching, add e to M. 



to any vertex r £ R with degree greater than 1; in SampleAndPermute, we add to M the first such edge 
seen in our permutation of L — L'. It follows immediately from Lemma [2!4l that E[ii;(M)] > p^(l — p)0PT/2, 
but accounting for the difference in pruning allows the following tighter statement, which we prove in the 
appendix. 

Lemma 2.5 E[w{M)] > ^^^OPT. 



We now present our final algorithm, a trivial modification of SampleAndPermute for the online BVM 
problem. 



SampleAndPrice(|L|, i?) 
k <— Binom{\L\,p) 
Let L' be the first k vertices of L. 
Ml ^ Greedy(G[L' U R]). 
For each r £ R: 

Set price{r) to be the weight of the edge incident to r in Mi. 
M^0 
For each subsequent I £ L — L', : 

Let e = (/, r) be the highest-weight edge such that w{e) > price{r) 

If M U e is a matching, accept e for M. 



As the input to SampleAndPrice is a random permutation, L' is a subset of L in which each vertex of 
L is selected with probability p; it is easy to see that this algorithm is equivalent to SampleAndPermute. 



Therefore, E[u;(M)] > 



p(i-p) 



OPT; setting p = 1/2 implies that the expected competitive ratio is 8. 



3 Independent Edge Sets in Hypergraphs 



In the Hypergraph Edge-at-a-time Matching (HEM) problem, we are initally given the vertex set of a hyper- 
graph; subsequently, hyperedges appear in a random order. When an edge (together with its weight) is revealed, 
the algorithm must immediately decide whether or not to accept it; as before, the goal is for the algorithm to 
select a maximum-weight set of disjoint edges. For arbitrary hypergraphs, one can observe that even the offline 
version of this problem is NP-Complete (and also hard to approximate) via an easy reduction from the Inde- 
pendent Set problem. However, the difficulty is related to the size of the hyperedges; if all edges contain only 
2 vertices, for instance, then we are simply trying to find a matching in a (possibly non-bipartite) graph. (Even 
in this special case, the problem is of interest in an online setting.) Let d denote the maximum size of an edge 
in the hypergraph. 



We provide an O(d^) -competitive algorithm for the HEM problem by solving the more general Hypergraph 
Vertex-at-a-time Matching (HVM) problem, described as follows: We are initally given a subset R of the 
vertex set of a hypergraph. The remaining vertices L arrive online; each edge of the hypergraph is constrained 
to contain exactly one vertex of L, together with some vertices of R. The vertices of L appear online in a 
random order; when / G L is revealed, the algorithm also sees all edges incident to I, together with their weight. 
At this point, the algorithm must immediately decide whether or not to accept some edge containing I, and if 
so, which edge; again, the goal is for the algorithm to select a maximum-weight set of disjoint edges. Here, 
let d denote the maximum number of vertices of R contained in a single edge (so the largest edge has d + 1 
vertices). First, we observe that the HEM problem with edge size d reduces to the HVM problem with edge 
size d+ 1: Let R be the vertex set of the original hypergraph, and add one vertex to L for each original edge. 
An edge of the new hypergraph consists of an old edge, together with the corresponding vertex of L. Clearly, 
observing a random permutation of L together with the incident edges is equivalent to a random permutation 
of the edge set of the original hypergraph. Also, notice that the the BVM problem of Section |2] is simply the 
special case of HVM when d = 1. (See Figure 1 at the end of this section.) 

These hypergraph problems capture the notion of demand bundles. For instance, in ad reservation systems, 
advertisers rarely make reservations for a single ad at a time; they are more likely to plan advertising campaigns 
involving multiple individual ads. In many campaigns, advertisers create various ads which are related to and 
complement or reinforce each other; these advertisers might be interested in acquiring a bundle or set of slots 
for this campaign. They submit to the reservation system the bundles they are interested in, together with the 
price they are willing to pay; the system must either accept a request for an entire bundle or reject it, as it 
does not receive revenue for providing the advertiser with a part of the bundle. If each advertiser submits a 
request for a single bundle, we obtain the HEM problem with vertex set corresponding to the set of slots. More 
generally, an advertiser may submit a request for one of a set of bundles, together with a price for each bundle. 
(For example, an advertiser might want an ad to appear in any three out of four local newspapers.) This leads 
to the HVM problem, with vertex set L corresponding to the set of advertisers, and set R to the set of slots: We 
receive a random permutation of advertisers, and each advertiser informs us of the bundles she is interested in, 
together with a price for each bundle. 

Let Greedy denote the offline algorithm for HVM that sorts edges in decreasing order of weight, and 
selects an edge if it is disjoint from all previously selected edges. For ease of exposition, we subsequently 
assume that the hypergraph is {d + l)-uniform; that is, that each edge contains exactly d vertices of R together 
with one vertex of L. 



Proposition 3.1 Greedy returns a {d+ 1) -approximation to the maximum-weight disjoint 
We again define an algorithm Simulate, as in Section|2j 



set. 



Sort edges of E in decreasing order of weight. 
Mark each vertex / G L as unassigned. 

For each edge e G i? in sorted order: 
Let I be the vertex of L in e 
If I is unassigned AND e is disjoint from Mi: 
Mark / as assigned. 
Flip a coin with probability p of heads 
If heads, add e to Mi 
If tails, add e to M2 
M3^0 
For each e G M2 : 

Add e to M3 if e is disjoint from the rest of M2. 



As before, we let w{F) denote the weight of an edge set F. The proofs of the following two propositions 
are exactly analogous to Proposition 12.21 and Lemma [231 

Proposition 3.2 E[w{Mi)] > p • OPT/(d + 1). 

Proposition 3.3 E[w{M2)] > {I - p)OPT / (d + 1). 

It is now slightly more complex to bound the weight of M3 than it was for the BVM problem; for BVM, 
the set of edges in M2 incident to v ^ R interfere only with each other, but in the hypergraph version, edges ei 
and 62 might not intersect, though they may both intersect 63, and hence all of ei, 62, 63 will have to be deleted. 
However, we can use a similar intuition: In BVM, we charge all edges of M2 incident to v to the heaviest such 
edge; in expectation, each edge is charged a constant number of times. For the HVM problem, we charge all 
the edges in a "connected component" to the heaviest edge in the component, and argue that (with a suitable 
choice of p) the average size of the components is small. More formally, we prove the following lemma: 



> 



OPT 



Lemma 3.4 Setting p= 1 - l/2d, E[w{M3ji ^ udid+i) - 
Proof: Construct an auxiliary directed graph F as follows: For each e G M2, add a corresponding vertex Ve to 
F. If e' is the heaviest edge in M2 that intersects e, add a directed arc from Ve to v^,' to F. (If e itself is this 
heaviest edge, Ve has no out-neighbors.) Note that the graph F is obviously a forest. For each e G M2, if Ve is 
not the root of its tree in F, we define Revenue2{e) to be 0, and if it is the root, we set Revenue2{e) to be the 
weight of all edges of M2 in the tree. Clearly, ^^ Revenue2{e) = w{M2). 

We define Revenue^{e) to be equal to the weight of e iff e is an edge in M2 that does not intersect any 
other such edge. (In which case, it follows that Ve is the root of its tree.) We prove that K[Revenue3{e)] > 

[ evenue2(e)\ ^ which provcs the lemma, since ^^ Revenue^{e) = w{M^). 

First, note that the probability that any edge e added to M2 intersects an edge added later is at most 1/2: 
For each vertex uof R contained in e, the probability that e intersects a later edge because of u is at most l/2d, 
as with probability 1 — l/2d, the next edge containing u considered by Simulate will be added to Mi. As e 
contains only d vertices in R, the desired probability is at most 1/2. (Every vertex of L is incident to at most 
one edge in M2, and so e cannot intersect any other edge through its vertex in L.) It follows that the probability 
that any v^ ^ F has a child is at most 1/2. We also count the expected number of children of Ve', the edge 
corresponding to each child of Ve must share some vertex with e, and the expected number of children through 
a particular vertex is at most X^^i ip(l — pY = (1 — p)/p. As e contains d vertices of R, the expected number 
of children of Ve is at most d{l — p)/p = 1/(2 — 1/d); since d > 2, the expected number of children is at most 
2/3. It follows that the expected size of a subtree rooted at Ve is at most 3. 

Note that Revenue2 (e) and Revenue^ (e) are both if Ve is not the root of its tree in F. Conditioned on Ve 
being a root, E[Revenue2{e)] < 3w{e), as e is the heaviest edge in its tree, and the expected size of the tree is 
at most 3. K[Revenue3{e)] is at least w{e)/2, as e intersects no previously added edges, and with probability 
at least 1/2, it intersects no edge added to M2 later. Therefore, the ratio of these expectations is at most 6, 
completing the proof. □ 

Now, we define our final algorithm S AMPLE AndPrice for the HVM problem: 



S ampleAndPrice(|L| , _R) 
k <— Binom{\L\, 1 — ■^) 
Let V be the first k elements of L. 
Ml ^ Greedy {G{L',R)). 
For each v (^ V: 

Set price{v) to be the weight of the edge incident to v in Mi. 
M^0 
For each subsequent I ^ L — L': 

Let e be the highest-weight edge containing I such that for each v ^ e, w{e) > price{v) 

If e is disjoint from M, add e to M. 



UYM{d + 1) 



HEM(<i) BVM 

1 1 

Edge-at-a-time Matchings Transversal Matroid Secretary 
(In arbitrary graphs) 

Figure 1 : Relationsiiips between the HVM problem and various special cases. 

As before, since the input is a random permutation of L, L' is a subset of L in which every vertex is selected 
independently with probability 1 — l/2d, and the matching M is at least as good as M3 from Simulate. 
Therefore, we have proved the following theorem: 

Theorem 3.5 S ampleAndPrice is an 0{cP) -competitive algorithm for the HVM secretary problem. 

Note that M may also contain extra edges that occur earlier in the permutation than edges they intersect; for 
the BVM problem, this was the difference between Lemma [2!4l and the stronger bound 1231 We do not provide 
a tighter analysis similar to Lemma 12.51 for the HVM problem in this extended abstract, nor make an attempt 
to optimize the constants of Lemma 13.41 In particular, for the HEM problem with d = 2 (finding an online 
matching in a non-bipartite graph G{V, E), given a random permutatation of E), we have a constant bound on 
the competitive ratio; a smaller constant can easily be obtained. 

4 Secretary Problems with Groups 

Consider a secretary-type problem in which, instead of receiving a random permutation of the elements, ele- 
ments can be grouped by an adversary. The algorithm receives the number of groups in advance, instead of 
the number of elements. However, once the groups have been constructed, they arrive in random order; when 
a group arrives, the algorithm can see all its elements at once. Note that the groups are fixed in advance; the 
adversary cannot construct groups in response to the algorithm's choices or the set of groups seen so far. The 
effect of such grouping on the difficulty of the problem is not immediately clear: The adversary can ensure that 
some permutations of the element set never occur, which might make the problem more difficult. On the other 
hand, as the algorithm is allowed to see several elements at once, it may be easier to compute a good solution. 

For instance, consider the classical secretary problem with groups. An optimal algorithm will never hire 
any but the best secretary from a group, and it is easy to obtain an e-competitive algorithm: Ignore all but the 
best secretary from each group, and run the standard secretary algorithm on these. That is, observe a constant 
(1/e) fraction of the groups, and note the value/price of the best secretary seen so far. From the rest of the input, 
hire the best secretary from the first group with a secretary to beat this price. Perhaps a reason this problem is 
as easy as the original version is that only one element is to be selected. 

By way of contrast, consider the following matching problem, even restricted to bipartite graphs: The 
algorithm is initially given the vertex set of a bipartite graph, and an adversary groups the edges arbitrarily. The 
groups arrive in random order; when a group arrives, the algorithm sees the weights of all edges it contains. The 
goal is to find a maximum-weight matching; note that as a special case of HEM with d = 2, we have an 0(1)- 
competitive algorithm for this problem without edge grouping. A natural S ample- And-Price algorithm for this 
problem is as follows: Look at a constant fraction of the input, and construct a matching with these edges (either 
the optimal matching, or the greedy matchings we used in the previous sections). Use the weights of edges in the 
matching to set vertex prices, and in the remainder of the input, select an edge if its weight is at least the price of 
each of its endpoints, and if it does not conflict with edges already selected. Unfortunately, this algorithm does 
not work: Consider a bipartite graph G{L U R), with L = {/i, ^2, • • • , In} and R = {ri, r2, . . . , r„}. We have 
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two groups of edges: Ei = {(/j,rj)|l < i < n}, with w{{li,ri)) = 1 + 2ie, and E2 = {(/i,rj+i)|l <i< n}, 
with i(;((/j,rj_|_i)) = 1 + (2i + l)e. Assuming e ^ 1/re^, Ei corresponds to an optimal matching, with 
weight PS n. If Ei arrives first, the price of each rj is 1 + 2ie. Subsequently, when E2 arrives, ii;((/j_i, rj)) = 
1 + {2i — l)e, and hence no edge of E2 beats the price of its right endpoint. If E2 arrives first, the price of each 
/j is 1 + {2i + l)e. Subsequently, when Ei arrives, w{{li,ri)) = 1 + 2ie, and so no edge except (/„, r„) beats 
the price of its left endpoint, for a total revenue of ps 1. 

We believe, therefore, that the introduction of groups affects these secretary-type problems in non-trivial 
ways, and these problems are likely to be of theoretical interest; in addition, they have applications to problems 
where groups occur naturally, and we do not receive a random permutation of the entire element set. To 
take another example from the advertising world, when a merchant plans a campaign, she may submit to the 
reservation system multiple ads, together with the slots in which each ad can be placed, and a price for each 
ad-slot combination. Even if the merchants arrive in a random order, this does not correspond to a random 
permutation of ads, and hence our previous analysis is not directly applicable. We model this (as in BVM) as 
an edge-weighted matching problem on a bipartite graph G{L U R, E) in which vertices of L may be grouped; 
here, the groups correspond to the set of ads for a given advertiser. The algorithm initially receives R (the set 
of slots), and the number of advertisers/groups; the adversary can construct groups from L arbitrarily. Once the 
groups have been fixed, a random permutation of the groups is seen, and when a group arrives, the algorithm 
must decide which ads to accept, and where to place them; as always, decisions are irrevocable. We refer to 
this as the BVM problem with groups. 

Theorem 4.1 There is an 0(log n)-competitive algorithm for the BVM problem with groups. 

It is easy to prove this theorem using standard techniques: Sample the first half of the vertices, and let w denote 
the weight of the heaviest edge seen so far. Pick an integer j uniformly at random in [0, 1 + [log2n]], and 
set a threshold of w/2-'. In the second half, greedily construct a matching using edges with weight above 
the threshold. (See, for instance. Theorem 3.2 of [1| for analysis of an essentially similar algorithm.) For 
completeness, we give a proof of Theorem 14. II in Section IA!2] of the appendix. 

A natural question is whether one can find a constant-competitive algorithm for BVM with groups. Note 
that one must be careful about using Sample- And-Price algorithms: First, as the example above shows, the 
natural algorithm with groups of edges instead of vertices does not work. Second, one might sample a constant 
fraction of groups, construct a matching Mi on the sampled groups, and then use Mi to set prices. However, 
once prices have been set in this way, the edges assigned to a group g may not be the same as the edges that 
would have been assigned to g in Mi if g had been sampled. This was not the case for the basic BVM problem: 
If an edge (/, r) is in M2, then by construction - fixing all other coin flips - if the coin for / had come up heads 
instead of tails, (/,r) would be in Mi. As the example in Figure [2] shows, this desirable property no longer 
holds once groups are introduced. 




Figure 2: Example for BVM with groups. Vertices A,C are in group 1, and vertex B is in group 2. Using the 
SampleAndPrice algorithm, if group 2 is sampled and group 1 is not, both edges incident to A and C beat 
their prices, and hence are added to M2. If both groups are sampled, A will be matched to X and B to Y in Mi, 
while C will remain unmatched. 

We conjecture that the following algorithm SampleWithGroups is constant-competitive for BVM with 
Groups. Here, G denotes the set of groups: 



SAMPLEWlTHGROUPS(|t?|, i?) 

Sample each group with probabiUty p. 

Construct a greedy matching Mi on the set of sampled groups Q'. 

M2 ^ 0. 

For each group ginQ — Q': 

Let E' denote the edges assigned to vertices of g in the greedy matching on Q' U g. 

M2*- M2U E'. 

M3^M2 

For each r ^ R: 

If r has degree > 1 in M3: 

Delete all edges incident to r from M3. 



It is easy to see that E[?i;(Mi)] > pOPT/2. By construction, the edges assigned to g in M2 are precisely 
those that would have been assigned to g in Mi if g had been sampled. (Hence, this algorithm differs from the 
natural SampleAndPrice.) Therefore, it follows that the probability an edge contributes to M2 is (1 — p)/p 
times the probability it contributes to Mi. If p = 1/2, it follows that E[?i;(M2)] = E[t(;(Afi)], and further, that 
the expected degree of r G i? in M2 is equal to its expected degree in Mi, which is at most 1 since Mi is a 
matching. This does not suffice to give a lower bound on the expected weight of M3, but we conjecture that the 
expected weight of M3 is at most a constant factor lower than that of M2. 

Conjecture 1 SampleWithGroups is constant-competitive for the BVM problem with groups. 

5 Graphic Matroids 

In this section, we describe a 2e-competitive algorithm for the Graphic Matroid Secretary problem. Here, we 
are initially given the set of vertices V of an undirected edge-weighted graph G = {V, E) together with the 
size of its edge set |£'|. The edges of the graph appear in a random order, and the goal is to accept a maximum- 
weight subset of edges F that does not contain any cycles. As always, the decision to accept an edge must be 
made upon its arrival, and cannot be revoked. 

This problem is equivalent to finding the maximum-weight spanning tree (assuming G is connected) and is 
also equivalent to finding the maximum-weight independent set in the graphic matroid defined by the graph G. 
Babaioff et al. [1] give a 16-competitive algorithm for the secretary version of this problem based on a related 
algorithm for transversal matroids. We give a simple reduction to the classical secretary problem, losing a 
factor of 2 in the reduction. In this way, we obtain a 2e ps 5.436-competitive algorithm for the Graphic Matroid 
Secretary problem. 

Fix an ordering vi,V2, ■ ■ ■ ,VnOn the vertices of G. Consider two directed graphs: graph Gq is obtained by 
orienting every edge of G from higher numbered to lower numbered vertex, and graph Gi by orienting every 
edge from lower to higher numbered vertex. 

Our online algorithm initially flips a fair coin X G {0, 1}. For each vertex v independently, it runs a 
secretary algorithm to find the maximum-weight edge leaving v in Gx- The output of the algorithm is F' , the 
union of all edges accepted by the individual secretary algorithms. Since the graph Gx is acyclic and each 
vertex has at most one outgoing edge, the set of edges F' must be acyclic even in the undirected sense. 

It remains to show a lower bound on the weight of F' . For each vertex v, let hx{v) be the heaviest edge 
leaving vertex v in Gx- Let Fx = {hx{v) \ v (^ V}. Let F* be a maximum-weight acyclic subgraph of G. 

Proposition 5.1 Y,v&v ^(f^oiv)) + w{hi{v)) > Y^eeF* ^(e)- 

Conditioned on the coin flip X, each secretary algorithm recovers at least 1/e fraction of the weight of the 
heaviest edge leaving its vertex. Hence E[w{F') \ X = x] = ^w{Fx) for x = 0, 1. Using Proposition 15. 1[ 
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E[w{F')] = \ {)^E\w{F') I X = 0] + \E\w{F') \X = \\)> j^w{F*). Therefore, we obtain the following 
theorem: 

Theorem 5.2 There is a 2e-competitive algorithm for the graphic matroid secretary problem. 

6 Conclusions and Open Problems 

We list several problems that remain to be solved: 

• An improved understanding of groups - and their contribution to the difficulty of secretary-type problems 
- is likely to be of interest. In particular, it may be possible to find a constant-competitive algorithm for 
the BVM problem with groups. 

• Few lower bounds for these problems are known beyond 1/e for the original secretary problem; obtaining 
such bounds may require new techniques. 

• In the basic BVM problem, we lose a factor of 2 by constructing greedy matchings. If, instead, we mod- 
ified our algorithm to set prices using an optimal matching Mi on the sampled vertices, is the resulting 
algorithm 4-competitive? Is it even 0(l)-competitive? 

• Finally, obtaining an 0(l)-competitive algorithm for the general matroid secretary problem is still open, 
though the competitive ratios for important special cases such as transversal and graphic matroids have 
been reduced to small constants. 

Acknowledgments: We would like to thank Florin Constantin, Jon Feldman, and S. Muthukrishnan for helpful 
discussions on BVM and related problems. 
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A Omitted Proofs 

A.l Proof of Lemma 12.51 

We prove Lemma [231 below, showing that SampleAndPrice is 8-competitive for the BVM problem. 

For each f G i?, we let Revenue2{v) be the revenue earned by v in M2, which is the total weight of edges 
in M2 incident to v. Similarly, Revenue3{v) denotes the weight of the (at most one) edge of M3 incident 
to V. Let Pi be the probability that v is incident to i edges in M2. Finally, we let ¥,[Revenue2{v)\i] and 
K[Revenues{v)\i] be the expected revenue earned by v in M2 and M3 respectively, conditioned on v being 
incident to i edges in M2. 

First, we note that 'E[Revenue3{v)\i] = evenue2(v)\i\ ^ ^^ ^^^ ^^^^ ^^^ of coin flips in which v has degree i 
in M2, we may see any of the i edges incident to v first in the random permutation; on average, then, we receive 
a l/i fraction of Revenue2{v). We then have the following equations: 



¥.[Revenue2{v)] = / ^Pj ■'&[Revenue2{v)\i]- (1) 

For ease of notation below, we use Wi to denote 'E[Revenue2{v)\i]. We wish to bound 'K[Revenue'j,{v)] in 
terms of E[i?efenue2(f )], and we do this as follows: First, we show that Pi < (1 — p)Pj_i, and Wi < j^Wi-i. 
Next, we prove that subject to these constraints, the worst-case ratio of these two expectations occurs when 
all the constraints hold with equality. We can then evaluate the sums, and show that E,[Revenues{v)] > 
pE[Revenue2{v)], completing our proof. 

It is easy to see that Wi < j^Wi-i; consider any partial history of Simulate in which i — 1 edges incident 
to V have been added to M2 so far; as we process edges in decreasing order of weight, the zth edge must 
be the lightest of those seen so far. As this is true for each (partial) history, it holds in expectation, and so 
Wi < Wi-i + TTT". Similarly, to see that Pi < {I — p)Pi-i, consider a partial history until the (i — l)st edge 
has just been added: M2 will have i — 1 edges incident to v if the coin for the next edge incident to v considered 
by Simulate comes up "heads", with probability p. M2 will have i edges incident to v if the coin for the next 
edge incident to v comes up "tails", and that for the following edge comes up heads, with probability (1 — p) -pu 
Again, as this holds for each history, we have Pj < (1 — p)Pi-i. 

To see that the worst-case ratio occurs when all these constraints hold with equality, notice that the ratio 
between successive terms of Equations (1) and (2) is increasing: The ratio between the ith terms is simply 
i. Let a denote the worst-case ratio of the expectations; from Lemma 1241 we already know that a < p^. If 
j = [1/aJ, for 1 < z < j, the ratio between the ith term of the two sums is at most a, while for i > j, the 
ratio is greater than a. Consider a choice of lUj's and Pj's such that the ratio between (1) and (2) be as large as 
possible, and suppose the constraints on Pi and Wi do not all hold with equality. Let k be an index such that 
Pfc < (1 — p)Pk-i or Wk < -j^Wk-i- If k > j, then by increasing P^ or Wk, we do not violate any constraint, 
and the increase in (1) is greater than a times the increase in (2). Similarly, if fc < j, by decreasing Pk-i or 
Wk-i to achieve equality, and also decreasing Pi . . . Pk-2 ox wi . . . Wk-2 to maintain feasibility, the decrease 
in (1) is less than a times the decrease in (2). In either of these situations, we increase the ratio between the two 
sums, contradicting our initial setting of Wi,Pi. 



'it is possible that there is only one more edge incident to n, in which case v will have i edges with probability (1 — p). However, 
this only helps the analysis. Alternatively, one can assume the existence of a large number of "zero-weight" edges incident to v. 
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Finally, we can now evaluate this worst case ratio. Setting Wi = j^Wi and Pj = (1 — p)Pi-i, we find: 

oo 
i=\ 

00 
E[i?et'eniie3(f)] = y ^w\P\(\ — pY^^ = wiPi/p = pElRevenue2{v)] 



i=l 



As Y.V HRevenue2iv)] = E[w{M2)] > (1 - p)0PT/2, we have EiwiMs)] > p{l - p)0PT/2, completing 
the proof of Lemma [231 

A.2 Other Proofs 

Proof of Theorem 14.11 We show that the algorithm of Theorem 14. 1 1 is 0(logn)-competitive for BVM with 
groups, closely following the analysis of 1 1] for an 0(log /c) -competitive algorithm for general matroids. Recall 
that the algorithm observes the first half of the vertices, and picks a random integer j G [0,1 + [log n] ]. If w 
is the weight of the heaviest edge seen so far, the algorithm sets a threshold of w/2^ , and in the second half, 
greedily constructs a matching using edges of weight greater than this threshold. 

Let OPT be an optimal matching; we also abuse notation and use OPT to refer to the weight of this 
matching, though the meaning will be clear from context. Let wi,W2, ■ ■ -Wk denote the weights of edges in 
OPT, such that Wi > Wj+i for 1 < i < A;. Let q denote the largest index in [1, k] such that Wq > wi/n. Clearly, 
Yl'i=i '^i > OPT/2, as the remaining edges all have weight less than wi/n, and there are fewer than n of them. 
For any set of edges F, we use ni{F) to denote the number of edges in F with weight at least Wi, and mi{F) 
to denote the number of edges in F with weight at least Wi/2. Now, we have: 

X]^i= \Y,^^iOPT){w, - Wi+i)\ +ng{OPT)wg 

i=l \i=l / 

Let M be the matching returned by our algorithm. We lower bound the weight of M as follows: 

mq{M)wg 



1 f"'^ 
;(M)>- 5^m,(M)( 

Vi=l 



w{M)>-\2^mi{M){wi-w,+i)\ + 

In order to obtain an 0(logn)-competitive algorithm, it suffices to show that for each I < i < q, 
E[mj(M)] > nj(OPT)/0(logn). First, consider the case of i = 1: ni(OPT) = 1, and we argue that 
E[mi(M)] > l/4([logn] + 1). With probability 1/4, the vertex v incident to the heaviest edge appears in 
the second half, and the heaviest edge not incident to any vertex of v's group appears in the first half. If this 
occurs, and the algorithm picks j = (which happens with probability l/([logn] + 1)), then the only edges 
with weight above the threshold are those incident to vertices in u's group. Thereore, the greedy algorithm will 
select the heaviest edge with probability l/4( [log n\ +1), and hence E[?7ii (M)] > l/4( [log n] +1). 

We now complete the argument for each i > 1. Let v be the vertex incident to the heaviest edge. We 
consider two cases: First, that at least half the edges of OPT with weight at least Wi are incident to vertices not 
in the same group as v, and second, that more than half these edges are incident to vertices of v's group. 

In the former case, suppose that v is seen in the first half. Let w be the weight of this heaviest edge, and let 
i' be the smallest integer in [0, 1 + [logn]] such that w/2* < WiO With probability ,r^^ n1+i) ' ^^^ algorithm 
picks j = i', and the threshold is set to be tt;/2* > Wi/2. Let X denote the event that the threshold is set to be 



Note that w may be greater than wi, as the heaviest edge may not be in opt. However, it is easy to see that w < 2wi, and since 
> wi/n, there always exists such an index i' . 
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it)/2* ; as we have seen, Pr[X] > l/2( [log n] + 1). We show that conditioned on X, E[?7ij(M)] is sufficiently 
large. 

Recall that OPT contains a matching of size i using edges of weight at least Wi ; it follows that in expectation, 
using edges of this weight, there is a matching in the second half of size at least i/A. (This is because at least 
half of these i edges are in other groups; even conditioned on v appearing in the first half, each of the remaining 
> i/2 edges could appear in either half.) Since we construct a greedy matching using edges of weight at 
least Wi/2, the expected size of this matching is at least i/8. Hence, with probability at least WTio^nV+i)' 
E[mi{M)] > i/8. That is, E [mi (M)] > i/16([logn] + 1). 

We now consider the second case, when more than half the edges of OPT with weight at least wi are in 
the same group as v. Let u be the vertex outside this group incident to the heaviest-weight edge. Suppose u's 
group appears in the first half, and w's group in the second. Let w be the weight of the heaviest edge incident 
to u;if w < Wi and we pick j = 0, the only edges above the threshold will be vertices in f 's group. Since we 
construct the greedy matching using only the group of v, and there exists a matching in this group with more 
than i/2 edges of weight Wi, the matching we construct has at least z/4 edges of weight at least Wi. If tt; > wi, 
then with probability l/[log n] + 1, we pick an index j such that Wi > w/2^ > Wi/2. Again, we will find a 
matching in which at least i/4 edges have weight at least Wi/2. Therefore, with probability at least 4(no n+iiv 
we find a matching of size at least i/4. Therefore, E[mj(M)] > i/l(i{ [log n\ +1). 

Therefore, we have E[^(M)] > ^^^^^^^1^^^^ J2ti ^^ > OPT/64( [log n] + 1). D 

Proof of Proposition UTIJ Let h{v) denote the heaviest edge incident to v; clearly J2v "^(^ol^)) +w{hi{v)) > 

X^t, w{h{v)). It remains to show that this latter sum is at least Yl^^p* w{e). To see this, consider the tree F*, 
and root it arbitrarily. For each edge e = (n, v) G F*, the weight of e is at most h{v), where v is the vertex 
further from the root. Each vertex v is charged by at most one edge, and so J^.^ w{h{v)) > J2e€F* '^(s)- '-' 
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